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(54) Information filtering method and device, and storage medium storing information filtering 
program 



(57) In an information filtering method, attributes 
included in information items are extracted and stored, 
and ratings relative to the information items carried out 
by users are stored. The users include a subject user 
and other users. A relationship between the ratings rel- 
ative to the information items rated by the subject user 
and the attributes thereof and a relationship between 
the ratings relative to the information items rated by the 
other users and the attributes thereof are utilized for 
estimating relevances to the subject user of the informa- 
tion items not rated by the subject user. The estimated 
relevances are used to carry out recommendation or f il- 
tering-in of the information item which matches with the 
subject user. 
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Description 

Background of the Invention: 

The present invention relates to an information fil- 
tering method, an information filtering device and a stor- 
age medium storing an information filtering program, for 
carrying out recommendation or filtering of information 
according to a user's interest or taste from an informa- 
tion system, such as database or the WWW (world wide 
web). 

Information filtering methods and devices, and 
information filtering programs (hereinafter collectively 
referred to as "information filtering systems") have been 
used to carry out recommendation orfiltering-in of infor- 
mation or services matching with interests or tastes of 
users from among a large amount of information or a 
large number of services (hereinafter collectively 
referred to as "information") offered by an information 
system, such as database, personal computer commu- 
nication or the WWW. 

In the conventional information filtering systems, 
there have been known two kinds of systems, that is, 
the CBF (content based filtering) system and the SIF 
(social information filtering) system. 

In the CBF system, the contents of items of infor- 
mation are featured by data, such as keywords or fre- 
quencies of words contained in the respective 
information items (hereinafter such data will be referred 
to as "attributes"), and the information filtering is per- 
formed based on comparison between the foregoing 
featured contents and interests of a user shown in the 
past and represented by weights of keywords or the like. 

On the other hand, in the SIF system, data about 
ratings of information items carried out by users (herein- 
after such data will be referred to as "ratings") are 
stored, and the information filtering is performed by 
comparing the stored ratings of a subject user (a user 
requesting the information filtering) and the stored rat- 
ings of the users other than the subject user to find the 
user/users having similar taste with the subject user and 
by selecting the information item which has been evalu- 
ated high by the foregoing similar user/users, but not yet 
evaluated by the subject user. 

Japanese First (unexamined) Patent Publication 
No. 3-94375 describes a document retrieval device as 
an example of the CBF system. On the other hand, as 
examples of the SIF system, a system called Grou- 
pLens is described in "proceedings of the cscw 1994" 
by acm press, 1994, pp. 175-186 and a system called 
Ringo is described in "proceedings of the chi95" by acm 
press, 1995, pp. 210-217. 

Each of the CBF and the SIF will far later be 
described in detail in conjunction with the drawing. 

In the CBF system, a relationship between 
attributes of information items, such as word frequen- 
cies, and ratings performed by the subject user is 
learned relative to information items which have been 



rated by the subject user, so as to find (filter in) informa- 
tion items which have not yet been rated by the subject 
user. On the other hand, in the foregoing SIF system, a 
relationship between the subject user and the users 

5 Other than the subject user is learned from ratings car- 
ried out by the subject user and the other users relative 
to the information items so as to filter the information 
items which have not been rated by the subject user, but 
rated by the other users. 

10 Specifically the first problem of the CBF system 
resides in that, although information about what the 
user is interested in (that is, attributes such as key- 
words) can be obtained, it is necessary for the user to 
judge whether the information is valuable or not, by 

15 reading it. The reason is that the CBF system is a sys- 
tem based on the attributes of keywords or word fre- 
quencies and not based on personal evaluations as the 
SIF system. 

The second problem of the CBF system resides in 

20 that the filtering accuracy is low relative to an informa- 
tion item having an unknown word or an unknown 
attribute value. The reason is that, in the CBF system, 
the relationship between the ratings by the subject user 
and the attributes of the information items is learned as 

25 the profile based on the rating history of the subject user 
and the attributes of the information items already rated 
by the subject user, and the relevances are estimated 
based on the learned profile. Accordingly in the CBF 
system, if the information item has an unknown word or 

30 an unknown attribute value, how to reflect it upon the 
relevance is unknown. 

On the other hand, the first problem of the SIF sys- 
tem resides in that only the information item that has 
been recommended or evaluated by the users can be 

35 obtained. The reason is that the SIF system is a system 
based on the recommendation or evaluation of the 
users so that only such a recommended or evaluated 
information item can be the subject of the filtering. 

The second problem of the SIF system resides in 

40 that the filtering accuracy is low unless a certain number 
of ratings have been collected. The reason is that the 
SIF system is a system based on the recommendation 
or evaluation of the users. 

45 Summary of the Invention: 

It is therefore an object of the present invention to 
provide an information filtering system which is capable 
of reducing or eliminating the foregoing problems inher- 
50 ent in the conventional CBF and SIF systems and per- 
forming recommendation or filtering-in of information 
with higher filtering accuracy and with more agreement 
to a user's interest or taste. 

Other objects of this invention will become clear as 
55 the description proceeds. 

According to an aspect of the present invention, 
there is provided an information filtering method com- 
prising the steps of obtaining attributes included in infor- 
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mation items, obtaining ratings relative to the 
information items performed by users including a sub- 
ject user and other users, estimating relevances to the 
subject user of the information items not rated by the 
subject user by the use of the ratings relative to the 
information items rated by the subject user, of the 
attributes of the information items rated by the subject 
user, of the ratings relative to the information items 
rated by the other users, and of the attributes of the 
information items rated by the other users, and carrying 
out recommendation or filtering-in for the subject user 
as regards the information items by the use of the rele- 
vances. 

According to another aspect of the present inven- 
tion, there is provided an information filtering method 
comprising first obtaining means for obtaining attributes 
included in information items, second obtaining means 
for obtaining ratings relative to the information items 
performed by users including a subject user and other 
users, estimating means connected to the first and the 
second obtaining means for estimating relevances to 
the subject user of the information items not rated by the 
subject user by the use of the ratings relative to the 
information items rated by the subject user, of the 
attributes of the information items rated by the subject 
user, of the ratings relative to the information items 
rated by the other users, and of the attributes of the 
information items rated by the other users, and carrying 
out means connected to the estimating means for carry- 
ing out recommendation or filtering-in for the subject 
user as regards the information items by the use of the 
relevances. 

According to still another aspect of the present 
invention, there is provided a storage medium storing an 
information filtering program which is executable by a 
computer. The information filtering program allows the 
computer to execute the steps of extracting attributes 
included in information items and storing the attributes 
in a storage device of the computer, and storing ratings 
relative to the information items performed by users 
including a subject user and other users in the storage 
device; using a relationship between the ratings relative 
to the information items rated by the subject user and 
the attributes thereof and a relationship between the rat- 
ings relative to the information items rated by the other 
users and the attributes thereof so as to estimate rele- 
vances to the subject user of the information items not 
rated by the subject user; and using the relevances to 
carry out recommendation or filtering-in of the informa- 
tion item matching with the subject user. 

According to yet another aspect of the present 
invention, there is provided a storage medium storing an 
information filtering program which is executable by a 
computer. The information filtering program allows the 
computer to function as: attribute extracting means for 
extracting attributes included in information items; 
attribute storing means for storing the extracted 
attributes; rating storing means for receiving inputs of 



rating values relative to the information items from users 
including a subject user and other users, and storing the 
received rating values; similarity deriving means for 
deriving, by referring to the rating storing means, simi- 

5 larities between the subject user and the other users 
based on the rating values inputted by the subject user 
relative to the information items and the rating values 
inputted by the other users relative to the information 
items; first relevance estimating means for estimating, 

w based on the derived similarities, first relevances to the 
subject user of the information items not rated by the 
subject user among the information items rated by the 
other users; a user profile learning means for learning, 
by referring to the attribute storing means and the rating 

15 Storing means, a relationship between the attributes 
included in the information items rated by the subject 
user and the ratings thereof, and further learning, based 
on the estimated first relevances, a relationship 
between the attributes included in the information items 

20 not rated by the subject user among the information 
items rated by the other users and the relevances 
thereof to the subject user; second relevance estimating 
means for estimating, based on the relationship 
between the attributes and the ratings and the relation- 

25 ship between the attributes and the relevances, second 
relevances to the subject user of the information items 
not rated by any of the subject user and the other users; 
and relevance unifying means for unifying the first and 
second relevances to carry out recommendation or fil- 

30 tering-in of the information item matching with the sub- 
ject user. 

Brief Description of the Drawing: 

35 Fig. 1 is a block diagram showing a structure of a 
conventional CBF system; 

Fig. 2 is a flowchart showing an operation of the 
conventional CBF system; 

Fig. 3 is a block diagram showing a structure of a 
40 conventional SIF system; 

Fig. 4 is a flowchart showing an operation of the 
conventional SIF system; 

Fig. 5 is a block diagram showing an information fil- 
tering device according to a preferred embodiment 
45 of the present invention; 

Fig. 6 is a flowchart showing an operation of the 
information filtering device shown in Fig. 5; 
Fig. 7 is a block diagram showing in detail a rele- 
vance estimating means of the information filtering 
50 device shown in Fig. 5; 

Fig. 8 is a flowchart showing the operation of the 
information filtering device in more detail according 
to the structure shown in Fig. 7; and 
Fig. 9 is a diagram for explaining information proc- 
55 essed by the information filtering according to the 
preferred embodiment of the present invention. 
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Description of the Preferred Embodiment: 

Now, conventional techniques according to the CBF 
system and the SIF system will be explained with refer- 
ence to Figs. 1 to 4. 

First, the CBF system will be explained using Figs. 
1 and 2. Referring to Fig. 1 , the CBF system comprises 
an input/output device 1, a data processing device 2 
and a storage device 3. 

The input/output device 1 comprises a request 
input section 11, an information display section 12 and 
a rating input section 13. 

The storage device 3 comprises an information 
storing section 31, an attribute storing section 32 and a 
rating storing section 33. The information storing sec- 
tion 31 prestores recorded information 30. The rating 
storing section 33 stores values of ratings performed by 
users relative to items of information, respectively 

The data processing device 2 comprises an 
attribute extracting section 21, a user profile learning 
section 22, a relevance estimating section 23 and an 
information selecting section 24. 

The attribute extracting section 21 extracts 
attributes, which can be features of the information 
items, from the information stored in the information 
storing section 31 and stores them in the attribute stor- 
ing section 32 for the respective information items. As 
explained above, the attribute features the content of 
the information item in terms of keyword data or word 
frequency data. The attribute may be, for example, "A 
certain information item includes keywords of [baseball] 
and [soccer]." and thus is an extraction of a feature of 
the information item. 

Upon receipt of a recommendation request of a 
subject user inputted through the request input section 
1 1 , the user profile learning section 22 learns a subject 
user profile based on the ratings of the information 
items performed by the subject user in the past and 
stored in the rating storing section 33 and the attributes 
of the information items already rated by the subject 
user among all the attributes of the information items 
stored in the attribute storing section 32. The subject 
user profile represents a relationship between the rat- 
ings by the subject user and the attributes of the infor- 
mation items and may be, for example, "A subject user 
gave a high rating in the past to an information item 
including a keyword of [baseball] while a low rating to an 
information item including a keyword of [soccer]". 
Accordingly, the subject user profile represents the 
interests of the subject user in terms of weights of the 
words (keywords) included in the information items. 

The relevance estimating section 23 estimates a 
relevance to the subject user of each of the information 
items based on the subject user profile learned by the 
user profile learning section 22 and the attributes of the 
information items stored in the attribute storing section 
32. Then, the information selecting section 24 uses the 
relevances estimated by the relevance estimating sec- 



tion 23 to select the information item suitable for the 
subject user from among the information items stored in 
the information storing section 31 and outputs it to the 
information display section 12. In this example, the rele- 

5 vance estimating section 23 receives the result of learn- 
ing of the subject user profile that the subject user is 
interested in the information item having the attribute of 
[baseball] but not interested in the information item hav- 
ing the attribute of [soccer], and estimates that the infor- 

w mation item having the keyword of [baseball] is suitable 
for the subject user. The information selecting section 
24 receives the estimation result from the relevance 
estimating section 23, searches for the information 
items having the attribute of [baseball], selects one, 

15 which has not yet been rated by the subject user, from 
the information storing section 31, and outputs it to the 
information display section 12. 

The subject user rates the information item dis- 
played at the information display section 12 to deter- 

20 mine whether it satisfies the subject user's request, and 
inputs a rating thereof through the rating input section 
13. The inputted rating is stored in the rating storing 
section 33. Accordingly the relevance feedback is car- 
ried out to increase the significance of words (keywords) 

25 contained in the information items in which the users 
are interested while reduce the significance of words 
contained in the information items in which the users 
are not interested. This makes it possible to carry out 
the information filtering which can further satisfy the 

30 user's interests. 

Now, an operation of the conventional CBF system 
will be explained with reference to Fig. 2. 

When a recommendation request is inputted from 
the request input section 11, the user profile learning 

35 section 22 learns a subject user profile, representing a 
relationship between the ratings by the subject user and 
the attributes of the information items, based on the rat- 
ing history of the subject user stored in the rating storing 
section 33 and the attributes of the information items 

40 already rated by the subject user among all the 
attributes of the information items stored in the attribute 
storing section 32 (step CI). Subsequently the rele- 
vance estimating section 23 estimates a relevance to 
the subject user of each of the information items based 

45 on the subject user profile learned by the user profile 
learning section 22 and the attributes of the information 
items stored in the attribute storing section 32 (step C2). 
Then, the information selecting section 24 uses the rel- 
evances estimated by the relevance estimating section 

50 23 to select the information item matching with the sub- 
ject user's request (step C3) and outputs it to the infor- 
mation display section 12 (step C4). 

The subject user evaluates the information item dis- 
played at the information display section 12 to deter- 

55 mine whether it matches with the subject user's request, 
and inputs the rating result from the rating input section 
13. The inputted rating data is stored in the rating stor- 
ing section 33. 
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Accordingly, in the CBF system, the contents of the 
information items are featured by the attributes, such as 
the keywords or the frequencies of words contained in 
the respective information items, and the information fil- 
tering can be carried out based on comparison between 
the foregoing featured contents and the interests of the 
user shown in the past and represented by the weights 
of l^eywords or the like. 

The foregoing example is simplified for brevity of 
explanation. 

In practice, the profile learning may be carried out in 
the following manner: 

It is assumed that feature vectors of the information 
items are considered as attribute values of the informa- 
tion items and that rating values of the information items 
are normalized between 1 and -1 , wherein 1 represents 
that the user request is satisfied while -1 represents that 
the user request is not satisfied. In this case, the profile 
is given by the following equation (1): 

^(RxV) 
Profile = (1) 

k 

wherein R represents a normalized rating value 
relative to a k-th information item, and V represents a 
feature vector of the k-th information item. 

Depending on kinds of the information, the accu- 
racy of profile may be increased by using a value, rather 
than an attribute value itself of an information item, 
obtained by dividing a deviation of the attribute value 
from a mean value of the attribute values by the stand- 
ard deviation, as an element value of the feature vector. 

Further, in some cases, the accuracy of profile may 
be increased by dividing the feature vectors by the 
absolute values thereof, respectively so as to normalize 
lengths of the vectors to 1 . 

For the relevance estimation, the Pearson product- 
moment correlation coefficients between the feature 
vectors and the profile may be used. 

Now, the SIF system will be explained using Figs. 3 
and 4. Explanation about structures having the same 
functions as the foregoing CBF system will be omitted. 

Referring to Fig. 3, the SIF system comprises an 
input/output device 1 , a data processing device 4 and a 
storage device 5. 

The input/output device 1 comprises a request 
input section 11, an information display section 12 and 
a rating input section 13. 

The storage device 5 comprises an information 
storing section 31 and a rating storing section 33. The 
information storing section 31 prestores recorded infor- 
mation 30. The rating storing section 33 stores values of 
ratings which have been performed by the subject user 
and the users other than the subject user relative to 



items of information, respectively 

The data processing device 4 comprises a similarity 
deriving section 25, a relevance estimating section 26 
and an information selecting section 24. 

5 Upon receipt of a recommendation request of the 

subject user inputted through the request input section 
1 1, the similarity deriving section 25 derives similarities 
between the users from a rating history of the subject 
user relative to the information items stored in the rating 

10 storing section 33 and rating histories of the users other 
than the subject user relative to the information items 
stored in the rating storing section 33, thereby to search 
out the user/users having similar taste with the subject 
user. 

15 The relevance estimating section 26 estimates a 
relevance to the subject user of each of the information 
items based on the ratings stored in the rating storing 
section 33 and the similarities derived by the similarity 
deriving section 25. Then, the information selecting sec- 

20 tion 24 uses the relevances estimated by the relevance 
estimating section 26 to select the information item 
matching with the subject user's request from among 
the information items stored in the information storing 
section 31 and outputs it to the information display sec- 

25 tion 1 2. Specifically the relevance estimating section 26 
estimates the relevances of the information items from 
the similarities derived by the similarity deriving section 
25 and, based on the estimated relevances, the infor- 
mation selecting section 24 selects the information item 

30 which has been evaluated high by the foregoing similar 
user/users, but not yet evaluated by the subject user, 
and outputs it to the information display section 12. 

The subject user rates the information item dis- 
played at the information display section 12 to deter- 

35 mine whether it matches with the subject user's request, 
and inputs a rating thereof through the rating input sec- 
tion 13. The inputted rating is stored in the rating storing 
section 33. 

The SIF system will be explained using an example. 

40 It is assumed that the information filtering is carried out 
for a subject user who is interested in [baseball] but not 
interested in [soccer]. In the SIF system, a user/users 
interested in [baseball] but not interested in [soccer] like 
the subject user are searched out, and the information 

45 item rated high by the searched-out user/users is 
offered to the subject user. Accordingly if the searched- 
out user/users are interested in [fishing] in addition to 
[baseball], an information item including [fishing] is also 
offered. Naturally it is preferable that the offered infor- 

50 mation item has not yet been seen by the subject user. 
Now, an operation of the conventional SIF system 
will be explained with reference to Fig. 4. 

When a recommendation request is inputted from 
the request input section 11 , the similarity deriving sec- 

55 tion 25 uses the ratings stored in the rating storing sec- 
tion 33 to derive similarities between the users (step 
D1). For example, the foregoing GroupLens and Ringo 
use the Pearson product-moment correlation coeffi- 
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cients as the similarities between the users. Subse- 
quently, the relevance estimating section 26 estimates a 
relevance to the subject user of each of the information 
items based on the ratings stored in the rating storing 
section 33 and the similarities derived by the similarity 
deriving section 25 (step D2). For example, the forego- 
ing GroupLens estimates the relevance using the fol- 
lowing equation (2): 

X (S X R) 
Relevance = ^ (2) 

k 

wherein S represents a similarity between a sub- 
ject user and a k-th user, and R represents a rating 
value by the k-th user. 

Then, the information selecting section 24 uses the 
relevances estimated by the relevance estimating sec- 
tion 26 to select the information item matching with the 
subject user's request (step D3) and outputs it to the 
information display section 12 (step D4). 

The subject user evaluates the information item dis- 
played at the information display section 12 to deter- 
mine whether it matches with the subject user's request, 
and inputs the rating result at the rating input section 13. 
The inputted rating data is stored in the rating storing 
section 33. 

Accordingly, in the SIF system, the information fil- 
tering can be performed by comparing the ratings car- 
ried out by the subject user in the past and the ratings 
carried out by the users other than the subject user in 
the past so as to search out the user/users having simi- 
lar taste with the subject user and selecting the informa- 
tion item which has been evaluated high by the 
foregoing similar user/users, but not yet seen by the 
subject user. 

Now, a preferred embodiment of the present inven- 
tion will be described in detail hereinbelow with refer- 
ence to the drawing. Unless otherwise explained, those 
elements assigned the same reference signs in the fig- 
ures are the same in function with each other through- 
out the foregoing prior art and the preferred 
embodiment of the present invention. Accordingly, 
explanation of those elements will be simplified or omit- 
ted for brevity of disclosure unless further explanation 
seems necessary 

Fig. 5 shows an information filtering device accord- 
ing to the preferred embodiment of the present inven- 
tion. The information filtering device comprises an 
input/output device 1 , a data processing device 6 and a 
storage device 7. 

The input/output device 1 comprises a request 
input section 11, an information display section 12 and 
a rating input section 13. 

The storage device 7 comprises an information 



storing section 31 , an attribute storing section 32 and a 
rating storing section 33. The information storing sec- 
tion 31 prestores recorded information 30. The rating 
storing section 33 stores values of ratings performed by 

5 a subject user and users other than the subject user rel- 
ative to items of information, respectively 

The data processing device 6 comprises an 
attribute extracting section 21, a relevance estimating 
section 27 and an information selecting section 24. 

10 The attribute extracting section 21 extracts 
attributes, such as keywords, which can be features of 
the information items, from the information stored in the 
information storing section 31 and stores them in the 
attribute storing section 32 for the respective informa- 

15 tion items. 

Upon receipt of a recommendation request of the 
subject user inputted through the request input section 
1 1, the relevance estimating section 27 uses the ratings 
of the information items performed by the subject user 

20 and the users other than the subject user (that is, the 
information items C in Fig. 9) and stored in the rating 
storing section 33 so as to derive similarities between 
the users, and then first derives relevances of the infor- 
mation items B in Fig. 9 to perform the information f ilter- 

25 ing thereof. 

Subsequently, the information filtering of the infor- 
mation items D in Fig. 9 is carried out. In the conven- 
tional CBF system, the information filtering of the 
information items D is carried out by learning, at the 

30 user profile learning section 22 in Fig. 1 , the relationship 
between the attributes of the information items (A+C) 
and the ratings thereof performed by the subject user. 
On the other hand, in this embodiment, the result of the 
information filtering relative to the information items B is 

35 also utilized so that a relationship between the attributes 
of the f iltered-in information items B and the relevances 
thereof to the subject user is also learned. In this speci- 
fication, the relevance and the rating have the same 
meaning. However, they are distinguished therebe- 

40 tween such that the rating is inputted by the user while 
the relevance represents a degree of matching to the 
subject user estimated through calculation. Accordingly 
the information filtering of the information items D is car- 
ried out by learning the relationship between the 

45 attributes of the information items among the informa- 
tion items (A+B+C) and the ratings (or relevances) 
thereof. 

The information selecting section 24 outputs the 
result of the information filtering by the relevance esti- 

50 mating section 27 to the information display section 12. 
The subject user evaluates the information item dis- 
played at the information display section 12 to deter- 
mine whether it matches with the subject user's request, 
and inputs a rating thereof through the rating input sec- 

55 tion 1 3. The inputted rating is stored in the rating storing 
section 33. 

Now, referring to Fig. 6, an operation of the informa- 
tion filtering device shown in Fig. 5 will be described. 
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Upon receipt of a recommendation request of the 
subject user from the request input section 1 1 , the rele- 
vance estimating section 27 uses the ratings of the sub- 
ject user and the users other than the subject user 
stored in the rating storing section 33 and the attributes 
of the information items stored in the attribute storing 
section 32 to analyze the relationship between the val- 
ues of the ratings by the users and the attributes of the 
information items, and estimates the relevances to the 
subject user of the information items (step A1). The 
information selecting section 24 uses the relevances 
estimated by the relevance estimating section 27 to 
select the information item matching with the subject 
user's request (step A2) and outputs it to the information 
display section 12 (step A3). The information selecting 
section 24 is referred to as a carrying out arrangement. 

Now, the relevance estimating section 27 will be 
described in more detail. 

In Fig. 7, the relevance estimating section 27 com- 
prises a similarity deriving section 271 , a first relevance 
estimating section 272, a user profile learning section 
273, a second relevance estimating section 274 and a 
relevance unifying section 275. 

Upon receipt of a recommendation request of the 
subject user from the request input section 1 1 , the sim- 
ilarity deriving section 271 uses the ratings of the infor- 
mation items carried out by the subject user and the 
users other than the subject user and stored in the rat- 
ing storing section 33 to derive the similarities between 
the users. 

From the ratings of the users other than the subject 
user stored in the rating storing section 33 and the sim- 
ilarities between the users derived by the similarity 
deriving section 271, the first relevance estimating sec- 
tion 272 estimates the relevances to the subject user of 
the information items B in Fig. 9. 

From the attributes of the information items stored 
in the attribute storing section 32, the ratings of the sub- 
ject user stored in the rating storing section 33 and the 
relevances estimated by the first relevance estimating 
section 272, the user profile learning section 273 learns 
a profile representing the relationship between the 
attributes of the information items and the ratings of the 
subject user and further learns a profile representing 
the relationship between the attributes of the informa- 
tion items and the relevances to the subject user. The 
profile represents the user's interest in terms of weights 
or the like of keywords or the like. 

Further, the profile representing the relationship 
between the attributes of the information items and the 
ratings of the subject user represents a relationship 
about how the subject user rated in the past the infor- 
mation items having certain attributes. For example, it 
represents a relationship between the attribute and the 
subject user, such as "A subject user gave in the past a 
high rating to an information item including a keyword of 
[baseball].". 

On the other hand, the profile representing the rela- 



tionship between the attributes of the information items 
and the relevances to the subject user represents a 
relationship between the attributes and the relevances 
to the subject user, and is obtained by referring to the 

5 attribute storing section 32 based on the relevances of 
the respective information items to the subject user esti- 
mated by the first relevance estimating section 272. 

For example, it is assumed that the subject user 
has given a high rating to an information item including 

10 a keyword of [baseball]. In this case, those information 
items are searched out, which are included in the infor- 
mation items B in Fig. 9 (as described before, the infor- 
mation items B have not been rated by the subject user, 
but rated by the users other than the subject user) and 

15 have been rated by the users other than the subject 
user who have given high ratings to the information 
items including the keyword of [baseball] like the subject 
user. Then, the attributes included in the searched-out 
information items are extracted. For example, if the 

20 users having similar taste with the subject user have 
evaluated high information items including a keyword of 
[fishing] in addition to the information items including the 
keyword of [base ball], a profile is learned which repre- 
sents a relationship between attributes and relevances 

25 to the subject user, such as "A relevance to a subject 
user may be high with respect to an information item 
including a keyword of [fishing] in addition to an informa- 
tion item including a keyword of [baseball].". 

Subsequently, from the attributes of the information 

30 items stored in the attribute storing section 32 and the 
profile of the subject user learned by the user profile 
learning section 273, the second relevance estimating 
section 274 derives the relevances to the subject user of 
the information items D in Fig. 9 (as described before, 

35 the information items D have not been rated by the 
users) and notifies them to the relevance unifying sec- 
tion 275. Specifically, in the foregoing example, among 
the information items D, the relevances are derived to 
be high for the information items including the keyword 

40 of [baseball], and similarly, the relevances are derived to 
be high for the information items including the keyword 
of [fishing]. 

It may also be arranged that the second relevance 
estimating section 274 estimates the relevances of, in 
45 addition to the information items D, the information 
items which have been evaluated by a small number of 
the users. 

The relevance unifying section 275 unifies the rele- 
vances derived by the first relevance estimating section 

50 272 and the second relevance estimating section 274 
for notification to the information selecting section 24. 

Now, referring to Fig. 8, the operation of the infor- 
mation filtering device will be described in more detail 
according to the structure of Fig. 7. 

55 Upon receipt of a recommendation request of the 
subject user from the request input section 1 1 , the sim- 
ilarity deriving section 271 uses the ratings of the sub- 
ject user and the users other than the subject user 
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Stored in the rating storing section 33 to derive the sim- 
ilarities between the users (step A1 1). From the ratings 
of the users other than the subject user stored in the rat- 
ing storing section 33 and the similarities between the 
users derived by the similarity deriving section 271 , the 5 
first relevance estimating section 272 estimates the rel- 
evances to the subject user of the respective informa- 
tion items (step A12). 

From the attributes of the information items stored 
in the attribute storing section 32, the ratings of the sub- 10 
ject user stored in the rating storing section 33 and the 
relevances estimated by the first relevance estimating 
section 272, the user profile learning section 273 learns 
the profile representing the relationship between the 
attributes of the information items and the ratings of the 15 
subject user and the profile representing the relation- 
ship between the attributes of the information items and 
the relevances to the subject user (step A13). 

From the attributes of the information items stored 
in the attribute storing section 32 and the profiles of the 20 
subject user learned by the user profile learning section 
273, the second relevance estimating section 274 esti- 
mates the relevances of the information items D relative 
to the subject user and notifies them to the relevance 
unifying section 275 (step A14). In this case, in addition 25 
to the information items D, the relevances to the subject 
user may be estimated for the information items which 
have been rated by a small number of the users. 

The relevance unifying section 275 unifies the rele- 
vances from the first relevance estimating section 272 30 
and the second relevance estimating section 274 and 
notifies the unified relevance to the information select- 
ing section 24. The relevance notifying section 275 uni- 
fies the relevances when the first and second relevance 
estimating section 272, 274 estimate different rele- 35 
vances for the same information item (step A15). 

The information selecting section 24 uses the uni- 
fied relevances from the relevance unifying section to 
select the information item matching with the subject 
user's request (step A2) and outputs it to the information 40 
display section 12 (step A3). 

The similarity between the users may be the same 
as in the conventional SIF system. For example, a vec- 
tor having, as elements, values of ratings performed by 
a user relative to information items may be used as a 45 
rating vector, and a Pearson product-moment correla- 
tion coefficient between rating vectors of the users may 
be used as a similarity between the users. 

The profile learning method may be the same as in 
the conventional CBF system. However, although the so 
relationship between the attributes of the information 
items and the ratings thereof is learned in the conven- 
tional CBF system, the relationship between the 
attributes of the information items and the rating values 
thereof and the relationship between the attributes of 55 
the information items and the relevances estimated by 
the first relevance estimating section are learned in this 
embodiment. For example, the profile can be derived by 



the following equation (3): 

Profile = ^ (3) 

k 

wherein R represents a normalized rating value 
or estimated relevance relative to a k-th information 
item, and V represents a feature vector of the k-th infor- 
mation item, and wherein "rating value or estimated rel- 
evance" represents that a rating value is used when 
there is rating, while an estimated relevance is used 
when there is no rating. 

The estimating method of the relevance used by the 
second relevance estimating section may be the same 
as in the conventional CBF system. For example, a 
Pearson product-moment correlation coefficient 
between a feature vector of an information item and a 
profile may be used. 

The second relevance estimating section estimates 
the relevance of the information item whose relevance 
can not be estimated by the first relevance estimating 
section or whose relevance is low in accuracy even if it 
can be estimated. For example, as described before, 
the relevance is estimated for the information item 
which has not been rated by any of the users or which 
has been rated by a small number of the users. 

As described before, when the first and second rel- 
evance estimating section 272 and 274 estimate differ- 
ent relevances for the same information item, the 
relevance unifying section 275 unifies the relevances. In 
this case, the relevance unifying section 275 may select 
the higher relevance or the lower relevance, use the 
mean value of the two relevances, select one of the two 
relevances with higher reliability or carry out others, 
depending on a field of the information so as to maxi- 
mize the reliability of the unified relevance. 

Further, a storage medium, such as a CD-ROM or 
a flexible disk, may store a program which is executable 
by a computer so that the operation of the information 
filtering device according to this embodiment is realized 
when the computer executes the stored program. 

Referring to Fig. 9, the description will be pro- 
ceeded as regards the effects of the information filtering 
device. Since the SIF system has the foregoing feature, 
the SIF system is used with respect to the information 
items B. On the other hand, with respect to the informa- 
tion items D, since only the CBF system deals with the 
information filtering of such information items, the CBF 
system is used. However, as described before, the CBF 
system has the problem of being poor at dealing with 
the unknown word. Such a problem is coped with by 
feeding back the filtering result of the SIF system to the 
profile of the CBF system. Accordingly the present 
invention can achieve the information filtering even rela- 
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tive to the information items D and is better at dealing 
with the unknown word than the conventional CBF sys- 
tem by using the ratings of the users other than the sub- 
ject user. 

The present invention will be understood more fully 
from the detailed description given hereinbelow, taken 
in conjunction with the accompanying drawings. 

According to the foregoing preferred embodiment, 
the following effects can be achieved: 

The first effect is to lessen the problem that it is nec- 
essary for the user to judge in the conventional CBF 
system whether the information is valuable or not, by 
reading it. The reason is that the personal evaluation of 
the information is introduced into the relevance estima- 
tion. 

The second effect is to lessen the problem that the 
filtering accuracy is low in the conventional CBF system 
relative to an information item having an unknown word 
or an unknown attribute value. The reason is that even if 
the word or attribute value is not included in the informa- 
tion item rated by the subject user, it can be used for the 
relevance estimation if it is included in an information 
item rated by the other user. 

The third effect is to solve the problems of the con- 
ventional SIF system that only the information item that 
has been recommended or evaluated by the users can 
be obtained and that the filtering accuracy is low unless 
a certain number of ratings have been collected. The 
reason is that the relevance estimation is carried out 
using also the attributes of the information items. 

While the present invention has been described in 
terms of the preferred embodiment, the invention is not 
to be limited thereto, but can be embodied in various 
ways without departing from the principle of the inven- 
tion. 

Claims 

1. An information filtering method including the steps 
of obtaining attributes included in information items 
and obtaining ratings relative to said information 
items performed by users including a subject user 
and other users, characterized by further compris- 
ing the steps of: 

estimating relevances to said subject user of 
the information items not rated by said subject 
user by the use of the ratings relative to the 
information items rated by said subject user, of 
the attributes of the information items rated by 
said subject user, of the ratings relative to the 
information items rated by said other users, 
and of the attributes of the information items 
rated by said other users; and 
carrying out recommendation or filtering-in for 
said subject user as regards said information 
items by the use of said relevances. 



2. An information filtering method as claimed in claim 
1 , wherein the estimating step comprises the steps 
of: 

5 deriving a first relationship between the ratings 

relative to the information items rated by said 
subject user and the attributes of the informa- 
tion items rated by said subject user; 
deriving a second relationship between the rat- 

10 ings relative to the information items rated by 

said other users and the attributes of the infor- 
mation items rated by said other users; and 
estimating the relevances to said subject user 
of the information items not rated by said sub- 

15 ject user by the use of said first and said sec- 

ond relationships. 

3. An information filtering method as claimed in claim 
1 , wherein the estimating step comprises the steps 

20 Of: 

deriving similarities between said subject user 
and said other users by the use of the ratings 
relative to the information items rated by said 
25 subject user and of the ratings relative to the 

information items rated by said other users; 
estimating, based on said similarities, a first rel- 
evance to said subject user of the information 
items not rated by said subject user among the 
30 information items rated by said other users; 

deriving a first relationship between the ratings 
relative to the information items rated by said 
subject user and the attributes of the informa- 
tion items rated by said subject user; 
35 deriving a second relationship between said 

first relevance and the attributes of the informa- 
tion items not rated by the subject user among 
the information items rated by said other users; 
estimating, based on said first and said second 
40 relationships, a second relevance to said sub- 

ject user of the information items not rated by 
any of said subject user and said other users; 
and 

unifying said first and said second relevances 
45 into the first-mentioned relevances, respec- 

tively. 

4. An information filtering method as claimed in claim 
1, wherein the first-mentioned obtaining step com- 

50 prises the steps of: 

extracting said attributes from said information 
items; and 

storing said attributes in an attribute storing 
55 section, 

the second-mentioned obtaining step compris- 
ing: 

receiving said ratings from said users; and 
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storing said ratings in a rating storing section. 

5. An information filtering device including first obtain- 
ing means (21 , 32) for obtaining attributes included 

in information items and second obtaining means s 
(13, 33) for obtaining ratings relative to said infor- 
mation items performed by users including a sub- 
ject user and other users, characterized by further 
comprising: 

10 

estimating means (27) connected to said first 
and said second obtaining means for estimat- 
ing relevances to said subject user of the infor- 
mation items not rated by said subject user by 
the use of the ratings relative to the information is 
items rated by said subject user, of the 
attributes of the information items rated by said 
subject user, of the ratings relative to the infor- 
mation items rated by said other users, and of 
the attributes of the information items rated by 20 
said other users; and 

carrying out means (24) connected to said esti- 
mating means for carrying out recommenda- 
tion or filtering-in for said subject user as 
regards said information items by the use of 25 
said relevances. 

6. An information filtering device as claimed in claim 5, 
wherein the estimating means comprises: 

30 

first deriving means connected to said first and 
said second obtaining means for deriving a first 
relationship between the ratings relative to the 
information items rated by said subject user 
and the attributes of the information items rated 35 
by said subject user; 

second deriving means connected to said first 
and said second obtaining means for deriving a 
second relationship between the ratings rela- 
tive to the information items rated by said other 40 
users and the attributes of the information 
items rated by said other users; and 
local estimating means connected to said first 
and said second deriving means and said car- 
rying out means for estimating the relevances 45 
to said subject user of the information items not 
rated by said subject user by the use of said 
first and said second relationships. 

7. An information filtering device as claimed in claim 5, so 
wherein the estimating means comprises: 

zeroth deriving means (271) connected to said 
first and said second obtaining means for deriv- 
ing similarities between said subject user and ss 
said other users by the use of the ratings rela- 
tive to the information items rated by said sub- 
ject user an d of the ratings relative to the 



information items rated by said other users; 
first local estimating means (272) connected to 
said zeroth deriving means for estimating, 
based on said similarities, a first relevance to 
said subject user of the information items not 
rated by said subject user among the informa- 
tion items rated by said other users; 
first deriving means (273) connected to said 
first and said second obtaining means for deriv- 
ing a first relationship between the ratings rela- 
tive to the information items rated by said 
subject user and the attributes of the informa- 
tion items rated by said subject user; 
second deriving means (273) connected to 
said first and said second obtaining means for 
deriving a second relationship between said 
first relevance and the attributes of the informa- 
tion items not rated by the subject user among 
the information items rated by said other users; 
second local estimating means (274) con- 
nected to said first and said second deriving 
means for estimating, based on said first and 
said second relationships, a second relevance 
to said subject user of the information items not 
rated by any of said subject user and said other 
users; and 

unifying means (275) connected to said first 
and said second local estimating means for 
unifying said first and said second relevances 
into the first-mentioned relevances, respec- 
tively. 

8. An information filtering device as claimed in claim 5, 
wherein said first obtaining means comprises: 

extracting means (21) for extracting said 
attributes from said information items; and 
an attribute storing section (32) connected to 
said extracting means and said estimating 
means for storing said attributes therein, 
said second obtaining means comprising: 
receiving means (13) for receiving said ratings 
from said users; and 

a rating storing section (33) connected to said 
receiving means and said estimating means for 
storing said ratings therein. 

9. A storage medium storing an information filtering 
program which is executable by a computer, char- 
acterized in that said program allowing said compu- 
ter to execute the steps of: 

extracting attributes included in information 
items and storing said attributes in a storage 
device of the computer, and storing ratings rel- 
ative to the information items performed by 
users including a subject user and other users 
in said storage device; 
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using a relationship between the ratings rela- 
tive to the information items rated by the sub- 
ject user and the attributes thereof and a 
relationship between the ratings relative to the 
information items rated by the other users and 5 
the attributes thereof so as to estimate rele- 
vances to the subject user of the information 
items not rated by the subject user; and 
using said relevances to carry out recommen- 
dation or filtering-in of the information item 10 
matching with the subject user. 

10. A storage medium storing an information filtering 
program which is executable by a computer, char- 
acterized in that said program allowing said compu- 15 
ter to function as: 

attribute extracting means for extracting 
attributes included in information items; 
attribute storing means for storing said 20 
extracted attributes; 

rating storing means for receiving inputs of rat- 
ing values relative to the information items from 
users including a subject user and other users, 
and storing said received rating values; 25 
similarity deriving means for deriving, by refer- 
ring to said rating storing means, similarities 
between the subject user and the other users 
based on the rating values inputted by the sub- 
ject user relative to the information items and 30 
the rating values inputted by the other users 
relative to the information items; 
first relevance estimating means for estimating, 
based on said derived similarities, first rele- 
vances to the subject user of the information 35 
items not rated by the subject user among the 
information items rated by the other users; a 
user profile learning means for learning, by 
referring to said attribute storing means and 
said rating storing means, a relationship 40 
between the attributes included in the informa- 
tion items rated by the subject user and the rat- 
ings thereof, and further learning, based on 
said estimated first relevances, a relationship 
between the attributes included in the informa- 45 
tion items not rated by the subject user among 
the information items rated by the other users 
and the relevances thereof to the subject user; 
second relevance estimating means for esti- 
mating, based on said relationship between the so 
attributes and the ratings and said relationship 
between the attributes and the relevances, sec- 
ond relevances to the subject user of the infor- 
mation items not rated by any of the subject 
user and the other users; and 55 
relevance unifying means for unifying said first 
and second relevances to carry out recommen- 
dation or filtering-in of the information item 



matching with the subject user. 



11 



EP 0 860 785 A1 



1 

d.... 

INPUT/OUTPUT 
DEVICE 



11 



REQUEST 

INPUT 
SECTION 



12 



INFORMATION 
DISPLAY 
SECTION 



13 



RATING 
INPUT 
SECTION 



DATA 

PROCESSING 
DEVICE 



ATTRIBUTE 
EXTRACTING 
SECTION 



T 

21 



22 



USER PROFILE 
LEARNING 
SECTION 



23 



RELEVANCE 
ESTIMATING 
SECTION 



24 



INFORMATION 
SELECTING 
SECTION 



RECORDED 
INFORMATION 



STORAGE 
DEVICE 



31 



INFORMATION 
STORING 
SECTION 



32 



ATTRIBUTE 
STORING 
SECTION 



33 



RATING 
STORING 
SECTION 



30 



FIG. 1 

PRIOR ART 



12 



EP 0 860 785 A1 



RECOMMENDATION \ 
REQUEST / 







LEARN USER PROFILE 






ESTIMATE RELEVANCE 






SELECT INFORMATION 






DISPLAY INFORMATION 



FIG. 2 

PRIOR ART 



13 



EP 0 860 785 A1 



1 

jd.... 

INPUT/OUTPUT 
DEVICE 

11 



REQUEST 

INPUT 
SECTION 



12 



INFORMATION 
DISPLAY 
SECTION 



13 



RATING 
INPUT 
SECTION 



DATA 

PROCESSING 
DEVICE 



25 



SIMILARITY 
DERIVING 
SECTION 



26 



RELEVANCE 
ESTIMATING 
SECTION 



24 



INFORMATION 
SELECTING 
SECTION 



RECORDED 
INFORMATION 



STORAGE 
DEVICE 



30 



33 



RATING 
STORING 
SECTION 



31 



INFORMATION 
STORING 
SECTION 



FIG. 3 

PRIOR ART 



14 



EP 0 860 785 A1 



RECOMMENDATION 
REQUEST 



I DERIVE SIMILARITIES 
BETWEEN USERS 



ESTIMATE RELEVANCE 



SELECT INFORMATION 

1 



DISPLAY INFORMATION 



FIG. 4 

PRIOR ART 



15 



EP 0 860 785 A1 



1 



INPUT/OUTPUT 
DEVICE 



11 



REQUEST 

INPUT 
SECTION 



12 



INFORMATION 
DISPLAY 
SECTION 



13 



RATING 
INPUT 
SECTION 



DATA 

PROCESSING 
DEVICE 



21 



ATTRIBUTE 
EXTRACTING 
SECTION 



27 



RELEVANCE 
ESTIMATING 
SECTION 



24 



INFORMATION 
SELECTING 
SECTION 



RECORDED 
INFORMATION 



STORAGE 
DEVICE 



-30 

7 



31 



INFORMATION 
STORING 
SECTION 



32 



ATTRIBUTE 
STORING 
SECTION 



33 



RATING 
STORING 
SECTION 



FIG. 5 



16 



EP 0 860 785 A1 



RECOMMENDATION 
REQUEST 




SELECT INFORMATION 



DISPLAY INFORMATION 



END 



FIG. 6 



17 



EP 0 860 785 A1 



27 



11 

REQUEST 

INPUT 
SECTION 



RELEVANCE 
ESTIMATING 
SECTION 



271 



SIMILARITY 


DERIVING SECTION 




272 






FIRST RELEVANCE 


ESTIMATING SECTION 




273 






USER PROFILE 


LEARNING 


SECTION 




274 




y 


SECOND RELEVANCE 


ESTIMATING SECTION 




275 






RELEVANCE 


UNIFYING 


SECTION 










INFORMATION 


SELECTING SECTION 



32 



ATTRIBUTE 
STORING 
SECTION 



33 



RATING 
STORING 
SECTION 



24 



FIG. 7 



18 



EP 0 860 785 A1 



RECOMMENDATION 
REQUEST 



1 




DERIVE SIMILARITIES 
BETWEEN USERS 







ESTIMATE RELEVANCE 






UNIFY RELEVANCES 






SELECT INFORMATION 






DISPLAY INFORMATION 







END 



All 




A12 



A13 



A14 



A15 



A2 



A3 



FIG. 8 



19 



EP 0 860 785 A1 



INFORMATION 
ITEMS RATED 
BY SUBJECT 
USER 



INFORMATION 
ITEMS RATED 
BY OTHER THAN 
SUBJECT 
USER 



D : INFORMATION ITEMS 
NOT YET RATED 



B 



FIG. 9 



20 



EP 0 860 785 A1 



European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 98 10 3190 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



Citation of document with indication, where appropriate, 
of relevant passages 



Relevant 
to claim 



CLASSIFICATION OF THE 
APPLICATION (lnt.CI.6) 



A,0 



EP 0 751 471 A (MASSACHUSETTS INST 
TECHNOLOGY) 2 January 1997 

* page 12, line 42 - page 14, line 50; 
figure 2 * 

* abstract * 

WO 95 29452 A (APPLE COMPUTER ;ROSE DANIEL 
E (US); BORNSTEIN JEREMY J (US); TIENE) 2 
November 1995 

* page 8, line 23 - page 12, line 7; 
figures 3-6 * 

GOLDBERG D ET AL: "USING COLLABORATIVE 
FILTERING TO WEAVE AN INFORMATION 
TAPESTRY" 

COMMUNICATIONS OF THE ASSOCIATION FOR 
COMPUTING MACHINERY, 
vol. 35, no. 12, 1 December 1992, 
pages 61-70, XP002000952 

* page 61, line 1 - page 64, right-hand 
column, line 37; figures 1,2 * 

RESNICK P ET AL: "GROUPLENS: AN OPEN 
ARCHITECTURE FOR COLLABORATIVE FILTERING 
OF NETNEWS" 

PROCEEDINGS OF CONFERENCE ON 
COMPUTER-SUPPORTED COOPERATIVE WORK, 7-10 
OCT. 1990, LOS ANGELES, 

22 October 1994, 
pages 175-186, XP000601284 

* page 177, right-hand column, line 25 - 
page 181, right-hand column, line 12 * 



1,5,9,10 



G06F17/30 



1,5,9,10 



1,5,9,10 



TECHNICAL FIELDS 
SEARCHED (lnt.CI.6) 



1,5,9,10 



G06F 



The present search report has been drawn up for all claims 



Place of seait:h 

BERLIN 



Date of complaton of the seatch 

12 May 1998 



Examiner 

Deane, E 



CATEGORY OF CITED DOCUMENTS 

X ; particutarly relevant if taken alone 

Y : particularly relevant if combined with another 

document of the same category 
A : tGchnotogical background 
O : non-written disclosure 
P : intermediate document 



T : theory or principle underlying the invention 
E : earlier patent document, but published on, or 

after the filing date 
D : document cited in the application 
L : document cited for other reasons 

& : memtjer of the same patent family, corresponding 
document 



21 



